Overview

Dataset statistics

Number of variables28
Number of observations5043
Missing cells2698
Missing cells (%)1.9%
Duplicate rows45
Duplicate rows (%)0.9%
Total size in memory3.5 MiB
Average record size in memory718.5 B

Variable types

NUM16
CAT11
URL1

Reproduction

Analysis started2020-03-15 19:50:47.175695
Analysis finished2020-03-15 19:52:01.887402
Versionpandas-profiling v2.5.3
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 45 (0.9%) duplicate rows Duplicates
director_name has a high cardinality: 2398 distinct values High cardinality
actor_2_name has a high cardinality: 3032 distinct values High cardinality
genres has a high cardinality: 914 distinct values High cardinality
actor_1_name has a high cardinality: 2097 distinct values High cardinality
movie_title has a high cardinality: 4917 distinct values High cardinality
actor_3_name has a high cardinality: 3521 distinct values High cardinality
plot_keywords has a high cardinality: 4760 distinct values High cardinality
country has a high cardinality: 65 distinct values High cardinality
cast_total_facebook_likes is highly correlated with actor_1_facebook_likesHigh Correlation
actor_1_facebook_likes is highly correlated with cast_total_facebook_likesHigh Correlation
director_name has 104 (2.1%) missing values Missing
director_facebook_likes has 104 (2.1%) missing values Missing
gross has 884 (17.5%) missing values Missing
plot_keywords has 153 (3.0%) missing values Missing
content_rating has 303 (6.0%) missing values Missing
budget has 492 (9.8%) missing values Missing
title_year has 108 (2.1%) missing values Missing
aspect_ratio has 329 (6.5%) missing values Missing
budget is highly skewed (γ1 = 48.15743539) Skewed
director_facebook_likes has 907 (18.0%) zeros Zeros
actor_3_facebook_likes has 89 (1.8%) zeros Zeros
facenumber_in_poster has 2152 (42.7%) zeros Zeros
actor_2_facebook_likes has 55 (1.1%) zeros Zeros
movie_facebook_likes has 2181 (43.2%) zeros Zeros

Variables

color
Categorical

Distinct count2
Unique (%)< 0.1%
Missing19
Missing (%)0.4%
Memory size19.8 KiB
Color
4815
Black and White
 
209
ValueCountFrequency (%) 
Color 4815 95.5%
 
Black and White 209 4.1%
 
(Missing) 19 0.4%
 

Length

Max length16
Mean length5.44834424
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 12 75.0%
 
Uppercase_Letter 3 18.8%
 
Space_Separator 1 6.2%
 
ValueCountFrequency (%) 
Latin 15 93.8%
 
Common 1 6.2%
 
ValueCountFrequency (%) 
ASCII 16 100.0%
 

director_name
Categorical

HIGH CARDINALITY
MISSING
Distinct count2398
Unique (%)48.6%
Missing104
Missing (%)2.1%
Memory size19.8 KiB
Steven Spielberg
 
26
Woody Allen
 
22
Clint Eastwood
 
20
Martin Scorsese
 
20
Ridley Scott
 
17
Other values (2393)
4834
ValueCountFrequency (%) 
Steven Spielberg 26 0.5%
 
Woody Allen 22 0.4%
 
Clint Eastwood 20 0.4%
 
Martin Scorsese 20 0.4%
 
Ridley Scott 17 0.3%
 
Steven Soderbergh 16 0.3%
 
Tim Burton 16 0.3%
 
Spike Lee 16 0.3%
 
Renny Harlin 15 0.3%
 
Oliver Stone 14 0.3%
 
Other values (2388) 4757 94.3%
 
(Missing) 104 2.1%
 

Length

Max length32
Mean length12.87685901
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 41 53.9%
 
Uppercase_Letter 31 40.8%
 
Other_Punctuation 2 2.6%
 
Dash_Punctuation 1 1.3%
 
Space_Separator 1 1.3%
 
ValueCountFrequency (%) 
Latin 72 94.7%
 
Common 4 5.3%
 
ValueCountFrequency (%) 
ASCII 56 100.0%
 

num_critic_for_reviews
Real number (ℝ≥0)

Distinct count528
Unique (%)10.6%
Missing50
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean140.1942719807731
Minimum1.0
Maximum813.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum1
5-th percentile9
Q150
median110
Q3195
95-th percentile387
Maximum813
Range812
Interquartile range (IQR)145

Descriptive statistics

Standard deviation121.6016754
Coefficient of variation (CV)0.8673797701
Kurtosis2.91341641
Mean140.194272
Median Absolute Deviation (MAD)92.35207408
Skewness1.5165327
Sum699990
Variance14786.96746
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 43 0.9%
 
9 37 0.7%
 
5 36 0.7%
 
10 35 0.7%
 
8 35 0.7%
 
12 34 0.7%
 
81 33 0.7%
 
16 33 0.7%
 
43 31 0.6%
 
29 30 0.6%
 
Other values (518) 4646 92.1%
 
(Missing) 50 1.0%
 
ValueCountFrequency (%) 
1 43 0.9%
 
2 26 0.5%
 
3 24 0.5%
 
4 29 0.6%
 
5 36 0.7%
 
ValueCountFrequency (%) 
813 1 < 0.1%
 
775 1 < 0.1%
 
765 1 < 0.1%
 
750 2 < 0.1%
 
739 1 < 0.1%
 

duration
Real number (ℝ≥0)

Distinct count191
Unique (%)3.8%
Missing15
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean107.2010739856802
Minimum7.0
Maximum511.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum7
5-th percentile81
Q193
median103
Q3118
95-th percentile146
Maximum511
Range504
Interquartile range (IQR)25

Descriptive statistics

Standard deviation25.19744081
Coefficient of variation (CV)0.235048399
Kurtosis22.56579716
Mean107.201074
Median Absolute Deviation (MAD)16.81590041
Skewness2.339134041
Sum539007
Variance634.9110233
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
90 161 3.2%
 
100 141 2.8%
 
101 139 2.8%
 
98 135 2.7%
 
97 131 2.6%
 
93 129 2.6%
 
95 124 2.5%
 
99 124 2.5%
 
94 124 2.5%
 
96 113 2.2%
 
Other values (181) 3707 73.5%
 
ValueCountFrequency (%) 
7 2 < 0.1%
 
11 1 < 0.1%
 
14 1 < 0.1%
 
20 1 < 0.1%
 
22 7 0.1%
 
ValueCountFrequency (%) 
511 1 < 0.1%
 
334 1 < 0.1%
 
330 1 < 0.1%
 
325 1 < 0.1%
 
300 1 < 0.1%
 

director_facebook_likes
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count435
Unique (%)8.8%
Missing104
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean686.5092123911724
Minimum0.0
Maximum23000.0
Zeros907
Zeros (%)18.0%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q17
median49
Q3194.5
95-th percentile973
Maximum23000
Range23000
Interquartile range (IQR)187.5

Descriptive statistics

Standard deviation2813.328607
Coefficient of variation (CV)4.098020181
Kurtosis27.25628935
Mean686.5092124
Median Absolute Deviation (MAD)1069.818414
Skewness5.22970117
Sum3390669
Variance7914817.85
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 907 18.0%
 
3 70 1.4%
 
6 66 1.3%
 
7 64 1.3%
 
2 63 1.2%
 
4 60 1.2%
 
11 59 1.2%
 
10 53 1.1%
 
8 52 1.0%
 
5 52 1.0%
 
Other values (425) 3493 69.3%
 
(Missing) 104 2.1%
 
ValueCountFrequency (%) 
0 907 18.0%
 
2 63 1.2%
 
3 70 1.4%
 
4 60 1.2%
 
5 52 1.0%
 
ValueCountFrequency (%) 
23000 1 < 0.1%
 
22000 8 0.2%
 
21000 10 0.2%
 
20000 1 < 0.1%
 
18000 4 0.1%
 

actor_3_facebook_likes
Real number (ℝ≥0)

ZEROS
Distinct count906
Unique (%)18.0%
Missing23
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean645.0097609561753
Minimum0.0
Maximum23000.0
Zeros89
Zeros (%)1.8%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile10
Q1133
median371.5
Q3636
95-th percentile1000
Maximum23000
Range23000
Interquartile range (IQR)503

Descriptive statistics

Standard deviation1665.041728
Coefficient of variation (CV)2.581420979
Kurtosis60.56388811
Mean645.009761
Median Absolute Deviation (MAD)569.3467201
Skewness7.279020793
Sum3237949
Variance2772363.957
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000 126 2.5%
 
0 89 1.8%
 
11000 29 0.6%
 
3 28 0.6%
 
2000 27 0.5%
 
3000 26 0.5%
 
826 22 0.4%
 
2 21 0.4%
 
4 21 0.4%
 
7 21 0.4%
 
Other values (896) 4610 91.4%
 
(Missing) 23 0.5%
 
ValueCountFrequency (%) 
0 89 1.8%
 
2 21 0.4%
 
3 28 0.6%
 
4 21 0.4%
 
5 18 0.4%
 
ValueCountFrequency (%) 
23000 2 < 0.1%
 
20000 1 < 0.1%
 
19000 5 0.1%
 
17000 1 < 0.1%
 
16000 3 0.1%
 

actor_2_name
Categorical

HIGH CARDINALITY
Distinct count3032
Unique (%)60.3%
Missing13
Missing (%)0.3%
Memory size19.8 KiB
Morgan Freeman
 
20
Charlize Theron
 
15
Brad Pitt
 
14
Meryl Streep
 
11
James Franco
 
11
Other values (3027)
4959
ValueCountFrequency (%) 
Morgan Freeman 20 0.4%
 
Charlize Theron 15 0.3%
 
Brad Pitt 14 0.3%
 
Meryl Streep 11 0.2%
 
James Franco 11 0.2%
 
Adam Sandler 10 0.2%
 
Jason Flemyng 10 0.2%
 
Angelina Jolie Pitt 9 0.2%
 
Thomas Kretschmann 9 0.2%
 
Steve Buscemi 9 0.2%
 
Other values (3022) 4912 97.4%
 
(Missing) 13 0.3%
 

Length

Max length28
Mean length13.0483839
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 48 60.0%
 
Uppercase_Letter 26 32.5%
 
Decimal_Number 2 2.5%
 
Other_Punctuation 2 2.5%
 
Dash_Punctuation 1 1.2%
 
Space_Separator 1 1.2%
 
ValueCountFrequency (%) 
Latin 74 92.5%
 
Common 6 7.5%
 
ValueCountFrequency (%) 
ASCII 58 100.0%
 

actor_1_facebook_likes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count878
Unique (%)17.4%
Missing7
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean6560.04706115965
Minimum0.0
Maximum640000.0
Zeros26
Zeros (%)0.5%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile95.5
Q1614
median988
Q311000
95-th percentile24000
Maximum640000
Range640000
Interquartile range (IQR)10386

Descriptive statistics

Standard deviation15020.75912
Coefficient of variation (CV)2.289733439
Kurtosis683.5473559
Mean6560.047061
Median Absolute Deviation (MAD)7727.675203
Skewness19.12177638
Sum33036397
Variance225623204.5
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000 449 8.9%
 
11000 211 4.2%
 
2000 197 3.9%
 
3000 155 3.1%
 
12000 135 2.7%
 
13000 127 2.5%
 
14000 123 2.4%
 
10000 112 2.2%
 
18000 109 2.2%
 
22000 82 1.6%
 
Other values (868) 3336 66.2%
 
ValueCountFrequency (%) 
0 26 0.5%
 
2 8 0.2%
 
3 4 0.1%
 
4 2 < 0.1%
 
5 7 0.1%
 
ValueCountFrequency (%) 
640000 1 < 0.1%
 
260000 3 0.1%
 
164000 2 < 0.1%
 
137000 2 < 0.1%
 
87000 8 0.2%
 

gross
Real number (ℝ≥0)

MISSING
Distinct count4035
Unique (%)97.0%
Missing884
Missing (%)17.5%
Infinite0
Infinite (%)0.0%
Mean48468407.52680933
Minimum162.0
Maximum760505847.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum162
5-th percentile99034
Q15340987.5
median25517500
Q362309437.5
95-th percentile180029729.4
Maximum760505847
Range760505685
Interquartile range (IQR)56968450

Descriptive statistics

Standard deviation68452990.44
Coefficient of variation (CV)1.412321839
Kurtosis14.86886885
Mean48468407.53
Median Absolute Deviation (MAD)45141337.64
Skewness3.127203838
Sum2.015801069e+11
Variance4.6858119e+15
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
144512310 3 0.1%
 
5773519 3 0.1%
 
177343675 3 0.1%
 
34964818 3 0.1%
 
47000000 3 0.1%
 
218051260 3 0.1%
 
3000000 3 0.1%
 
8000000 3 0.1%
 
800000 2 < 0.1%
 
22494487 2 < 0.1%
 
Other values (4025) 4131 81.9%
 
(Missing) 884 17.5%
 
ValueCountFrequency (%) 
162 1 < 0.1%
 
703 1 < 0.1%
 
721 1 < 0.1%
 
728 1 < 0.1%
 
828 1 < 0.1%
 
ValueCountFrequency (%) 
760505847 1 < 0.1%
 
658672302 1 < 0.1%
 
652177271 1 < 0.1%
 
623279547 2 < 0.1%
 
533316061 1 < 0.1%
 

genres
Categorical

HIGH CARDINALITY
Distinct count914
Unique (%)18.1%
Missing0
Missing (%)0.0%
Memory size19.8 KiB
Drama
 
236
Comedy
 
209
Comedy|Drama
 
191
Comedy|Drama|Romance
 
187
Comedy|Romance
 
158
Other values (909)
4062
ValueCountFrequency (%) 
Drama 236 4.7%
 
Comedy 209 4.1%
 
Comedy|Drama 191 3.8%
 
Comedy|Drama|Romance 187 3.7%
 
Comedy|Romance 158 3.1%
 
Drama|Romance 152 3.0%
 
Crime|Drama|Thriller 101 2.0%
 
Horror 71 1.4%
 
Action|Crime|Drama|Thriller 68 1.3%
 
Action|Crime|Thriller 65 1.3%
 
Other values (904) 3605 71.5%
 

Length

Max length64
Mean length20.31310728
Min length5
ValueCountFrequency (%) 
Lowercase_Letter 19 54.3%
 
Uppercase_Letter 14 40.0%
 
Dash_Punctuation 1 2.9%
 
Math_Symbol 1 2.9%
 
ValueCountFrequency (%) 
Latin 33 94.3%
 
Common 2 5.7%
 
ValueCountFrequency (%) 
ASCII 35 100.0%
 

actor_1_name
Categorical

HIGH CARDINALITY
Distinct count2097
Unique (%)41.6%
Missing7
Missing (%)0.1%
Memory size19.8 KiB
Robert De Niro
 
49
Johnny Depp
 
41
Nicolas Cage
 
33
J.K. Simmons
 
31
Bruce Willis
 
30
Other values (2092)
4852
ValueCountFrequency (%) 
Robert De Niro 49 1.0%
 
Johnny Depp 41 0.8%
 
Nicolas Cage 33 0.7%
 
J.K. Simmons 31 0.6%
 
Bruce Willis 30 0.6%
 
Matt Damon 30 0.6%
 
Denzel Washington 30 0.6%
 
Liam Neeson 29 0.6%
 
Steve Buscemi 27 0.5%
 
Robin Williams 27 0.5%
 
Other values (2087) 4709 93.4%
 

Length

Max length27
Mean length13.1782669
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 42 55.3%
 
Uppercase_Letter 28 36.8%
 
Decimal_Number 2 2.6%
 
Other_Punctuation 2 2.6%
 
Dash_Punctuation 1 1.3%
 
Space_Separator 1 1.3%
 
ValueCountFrequency (%) 
Latin 70 92.1%
 
Common 6 7.9%
 
ValueCountFrequency (%) 
ASCII 58 100.0%
 

movie_title
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count4917
Unique (%)97.5%
Missing0
Missing (%)0.0%
Memory size19.8 KiB
Pan 
 
3
Victor Frankenstein 
 
3
The Fast and the Furious 
 
3
Home 
 
3
Halloween 
 
3
Other values (4912)
5028
ValueCountFrequency (%) 
Pan  3 0.1%
 
Victor Frankenstein  3 0.1%
 
The Fast and the Furious  3 0.1%
 
Home  3 0.1%
 
Halloween  3 0.1%
 
King Kong  3 0.1%
 
Ben-Hur  3 0.1%
 
Godzilla Resurgence  2 < 0.1%
 
The Texas Chain Saw Massacre  2 < 0.1%
 
Dodgeball: A True Underdog Story  2 < 0.1%
 
Other values (4907) 5016 99.5%
 

Length

Max length87
Mean length16.54967281
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 35 36.1%
 
Uppercase_Letter 27 27.8%
 
Other_Punctuation 12 12.4%
 
Decimal_Number 10 10.3%
 
Open_Punctuation 2 2.1%
 
Space_Separator 2 2.1%
 
Currency_Symbol 2 2.1%
 
Close_Punctuation 2 2.1%
 
Dash_Punctuation 1 1.0%
 
Other_Symbol 1 1.0%
 
Other values (3) 3 3.1%
 
ValueCountFrequency (%) 
Latin 62 63.9%
 
Common 35 36.1%
 
ValueCountFrequency (%) 
ASCII 82 100.0%
 

num_voted_users
Real number (ℝ≥0)

Distinct count4826
Unique (%)95.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83668.16081697402
Minimum5
Maximum1689764
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum5
5-th percentile514.6
Q18593.5
median34359
Q396309
95-th percentile332254.9
Maximum1689764
Range1689759
Interquartile range (IQR)87715.5

Descriptive statistics

Standard deviation138485.2568
Coefficient of variation (CV)1.655172714
Kurtosis24.44552017
Mean83668.16082
Median Absolute Deviation (MAD)84252.04372
Skewness4.029871144
Sum421938535
Variance1.917816635e+10
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[5.000000e+00 1.240000e+02 6.095000e+02 1.669500e+03 6.382500e+03 ... 2.222510e+05 3.340960e+05 5.374305e+05 8.868355e+05 1.689764e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
57 5 0.1%
 
6 4 0.1%
 
6025 3 0.1%
 
374 3 0.1%
 
53 3 0.1%
 
3119 3 0.1%
 
62 3 0.1%
 
162 3 0.1%
 
2541 3 0.1%
 
8 3 0.1%
 
Other values (4816) 5010 99.3%
 
ValueCountFrequency (%) 
5 2 < 0.1%
 
6 4 0.1%
 
7 2 < 0.1%
 
8 3 0.1%
 
10 1 < 0.1%
 
ValueCountFrequency (%) 
1689764 1 < 0.1%
 
1676169 1 < 0.1%
 
1468200 1 < 0.1%
 
1347461 1 < 0.1%
 
1324680 1 < 0.1%
 

cast_total_facebook_likes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3978
Unique (%)78.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9699.06385088241
Minimum0
Maximum656730
Zeros33
Zeros (%)0.7%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile179
Q11411
median3090
Q313756.5
95-th percentile36927.7
Maximum656730
Range656730
Interquartile range (IQR)12345.5

Descriptive statistics

Standard deviation18163.79912
Coefficient of variation (CV)1.872737349
Kurtosis361.2551153
Mean9699.063851
Median Absolute Deviation (MAD)10152.51874
Skewness12.83192773
Sum48912379
Variance329923598.6
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000e+00 1.00000e+00 8.15000e+01 2.76450e+03 3.31150e+03 ... 5.40790e+04 6.46985e+04 9.22280e+04 1.55193e+05 6.56730e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 33 0.7%
 
5 7 0.1%
 
2020 6 0.1%
 
2 6 0.1%
 
1044 5 0.1%
 
673 5 0.1%
 
29 5 0.1%
 
2321 4 0.1%
 
1554 4 0.1%
 
646 4 0.1%
 
Other values (3968) 4964 98.4%
 
ValueCountFrequency (%) 
0 33 0.7%
 
2 6 0.1%
 
3 1 < 0.1%
 
4 2 < 0.1%
 
5 7 0.1%
 
ValueCountFrequency (%) 
656730 1 < 0.1%
 
303717 1 < 0.1%
 
283939 1 < 0.1%
 
263584 1 < 0.1%
 
261818 1 < 0.1%
 

actor_3_name
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count3521
Unique (%)70.1%
Missing23
Missing (%)0.5%
Memory size19.8 KiB
John Heard
 
8
Ben Mendelsohn
 
8
Steve Coogan
 
8
Kirsten Dunst
 
7
Sam Shepard
 
7
Other values (3516)
4982
ValueCountFrequency (%) 
John Heard 8 0.2%
 
Ben Mendelsohn 8 0.2%
 
Steve Coogan 8 0.2%
 
Kirsten Dunst 7 0.1%
 
Sam Shepard 7 0.1%
 
Anne Hathaway 7 0.1%
 
Jon Gries 7 0.1%
 
Stephen Root 7 0.1%
 
Lois Maxwell 7 0.1%
 
Robert Duvall 7 0.1%
 
Other values (3511) 4947 98.1%
 
(Missing) 23 0.5%
 

Length

Max length29
Mean length13.03628792
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 44 54.3%
 
Uppercase_Letter 31 38.3%
 
Decimal_Number 2 2.5%
 
Other_Punctuation 2 2.5%
 
Dash_Punctuation 1 1.2%
 
Space_Separator 1 1.2%
 
ValueCountFrequency (%) 
Latin 75 92.6%
 
Common 6 7.4%
 
ValueCountFrequency (%) 
ASCII 58 100.0%
 

facenumber_in_poster
Real number (ℝ≥0)

ZEROS
Distinct count19
Unique (%)0.4%
Missing13
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean1.3711729622266402
Minimum0.0
Maximum43.0
Zeros2152
Zeros (%)42.7%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum43
Range43
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.01357592
Coefficient of variation (CV)1.468506144
Kurtosis52.03373533
Mean1.371172962
Median Absolute Deviation (MAD)1.357893277
Skewness4.384765939
Sum6897
Variance4.054487986
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 2152 42.7%
 
1 1251 24.8%
 
2 716 14.2%
 
3 380 7.5%
 
4 207 4.1%
 
5 114 2.3%
 
6 76 1.5%
 
7 48 1.0%
 
8 37 0.7%
 
9 18 0.4%
 
Other values (9) 31 0.6%
 
(Missing) 13 0.3%
 
ValueCountFrequency (%) 
0 2152 42.7%
 
1 1251 24.8%
 
2 716 14.2%
 
3 380 7.5%
 
4 207 4.1%
 
ValueCountFrequency (%) 
43 1 < 0.1%
 
31 1 < 0.1%
 
19 1 < 0.1%
 
15 6 0.1%
 
14 1 < 0.1%
 

plot_keywords
Categorical

HIGH CARDINALITY
MISSING
UNIFORM
Distinct count4760
Unique (%)97.3%
Missing153
Missing (%)3.0%
Memory size19.8 KiB
based on novel
 
4
animal name in title|ape abducts a woman|gorilla|island|king kong
 
3
1940s|child hero|fantasy world|orphan|reference to peter pan
 
3
alien friendship|alien invasion|australia|flying car|mother daughter relationship
 
3
halloween|masked killer|michael myers|slasher|trick or treat
 
3
Other values (4755)
4874
ValueCountFrequency (%) 
based on novel 4 0.1%
 
animal name in title|ape abducts a woman|gorilla|island|king kong 3 0.1%
 
1940s|child hero|fantasy world|orphan|reference to peter pan 3 0.1%
 
alien friendship|alien invasion|australia|flying car|mother daughter relationship 3 0.1%
 
halloween|masked killer|michael myers|slasher|trick or treat 3 0.1%
 
eighteen wheeler|illegal street racing|truck|trucker|undercover cop 3 0.1%
 
one word title 3 0.1%
 
assistant|experiment|frankenstein|medical student|scientist 3 0.1%
 
race relations|racism|racist|social problem|stereotype 2 < 0.1%
 
casino|espionage|free running|james bond|terrorist 2 < 0.1%
 
Other values (4750) 4861 96.4%
 
(Missing) 153 3.0%
 

Length

Max length149
Mean length50.93337299
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 26 61.9%
 
Decimal_Number 10 23.8%
 
Other_Punctuation 2 4.8%
 
Open_Punctuation 1 2.4%
 
Space_Separator 1 2.4%
 
Math_Symbol 1 2.4%
 
Close_Punctuation 1 2.4%
 
ValueCountFrequency (%) 
Latin 26 61.9%
 
Common 16 38.1%
 
ValueCountFrequency (%) 
ASCII 42 100.0%
 
Distinct count4919
Unique (%)97.5%
Missing0
Missing (%)0.0%
Memory size19.8 KiB
http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1
 
3
Other values (4914)
5028
ValueCountFrequency (%) 
http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt2638144/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt0083722/?ref_=fn_tt_tt_1 2 < 0.1%
 
http://www.imdb.com/title/tt0844708/?ref_=fn_tt_tt_1 2 < 0.1%
 
http://www.imdb.com/title/tt1666335/?ref_=fn_tt_tt_1 2 < 0.1%
 
Other values (4909) 5016 99.5%
 
ValueCountFrequency (%) 
http 5043 100.0%
 
ValueCountFrequency (%) 
www.imdb.com 5043 100.0%
 
ValueCountFrequency (%) 
/title/tt3332064/ 3 0.1%
 
/title/tt2638144/ 3 0.1%
 
/title/tt0232500/ 3 0.1%
 
/title/tt2224026/ 3 0.1%
 
/title/tt1976009/ 3 0.1%
 
/title/tt0077651/ 3 0.1%
 
/title/tt0360717/ 3 0.1%
 
/title/tt0844708/ 2 < 0.1%
 
/title/tt0138304/ 2 < 0.1%
 
/title/tt0795368/ 2 < 0.1%
 
Other values (4909) 5016 99.5%
 
ValueCountFrequency (%) 
ref_=fn_tt_tt_1 5043 100.0%
 
ValueCountFrequency (%) 
5043 100.0%
 

num_user_for_reviews
Real number (ℝ≥0)

Distinct count954
Unique (%)19.0%
Missing21
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean272.77080844285143
Minimum1.0
Maximum5060.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum1
5-th percentile10
Q165
median156
Q3326
95-th percentile907.8
Maximum5060
Range5059
Interquartile range (IQR)261

Descriptive statistics

Standard deviation377.9828856
Coefficient of variation (CV)1.385716044
Kurtosis26.43829739
Mean272.7708084
Median Absolute Deviation (MAD)228.8571855
Skewness4.121475159
Sum1369855
Variance142871.0618
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 51 1.0%
 
3 33 0.7%
 
26 32 0.6%
 
2 32 0.6%
 
10 29 0.6%
 
6 28 0.6%
 
50 26 0.5%
 
32 25 0.5%
 
8 25 0.5%
 
31 24 0.5%
 
Other values (944) 4717 93.5%
 
ValueCountFrequency (%) 
1 51 1.0%
 
2 32 0.6%
 
3 33 0.7%
 
4 23 0.5%
 
5 19 0.4%
 
ValueCountFrequency (%) 
5060 1 < 0.1%
 
4667 1 < 0.1%
 
4144 1 < 0.1%
 
3646 1 < 0.1%
 
3597 1 < 0.1%
 

language
Categorical

Distinct count47
Unique (%)0.9%
Missing12
Missing (%)0.2%
Memory size19.8 KiB
English
4704
French
 
73
Spanish
 
40
Hindi
 
28
Mandarin
 
26
Other values (42)
 
160
ValueCountFrequency (%) 
English 4704 93.3%
 
French 73 1.4%
 
Spanish 40 0.8%
 
Hindi 28 0.6%
 
Mandarin 26 0.5%
 
German 19 0.4%
 
Japanese 18 0.4%
 
Italian 11 0.2%
 
Cantonese 11 0.2%
 
Russian 11 0.2%
 
Other values (37) 90 1.8%
 
(Missing) 12 0.2%
 

Length

Max length10
Mean length6.971247273
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 23 53.5%
 
Uppercase_Letter 20 46.5%
 
ValueCountFrequency (%) 
Latin 43 100.0%
 
ValueCountFrequency (%) 
ASCII 43 100.0%
 

country
Categorical

HIGH CARDINALITY
Distinct count65
Unique (%)1.3%
Missing5
Missing (%)0.1%
Memory size19.8 KiB
USA
3807
UK
 
448
France
 
154
Canada
 
126
Germany
 
97
Other values (60)
 
406
ValueCountFrequency (%) 
USA 3807 75.5%
 
UK 448 8.9%
 
France 154 3.1%
 
Canada 126 2.5%
 
Germany 97 1.9%
 
Australia 55 1.1%
 
India 34 0.7%
 
Spain 33 0.7%
 
China 30 0.6%
 
Japan 23 0.5%
 
Other values (55) 231 4.6%
 

Length

Max length20
Mean length3.488796351
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 24 51.1%
 
Uppercase_Letter 22 46.8%
 
Space_Separator 1 2.1%
 
ValueCountFrequency (%) 
Latin 46 97.9%
 
Common 1 2.1%
 
ValueCountFrequency (%) 
ASCII 47 100.0%
 

content_rating
Categorical

MISSING
Distinct count18
Unique (%)0.4%
Missing303
Missing (%)6.0%
Memory size19.8 KiB
R
2118
PG-13
1461
PG
701
Not Rated
 
116
G
 
112
Other values (13)
 
232
ValueCountFrequency (%) 
R 2118 42.0%
 
PG-13 1461 29.0%
 
PG 701 13.9%
 
Not Rated 116 2.3%
 
G 112 2.2%
 
Unrated 62 1.2%
 
Approved 55 1.1%
 
TV-14 30 0.6%
 
TV-MA 20 0.4%
 
X 13 0.3%
 
Other values (8) 52 1.0%
 
(Missing) 303 6.0%
 

Length

Max length9
Mean length2.825104105
Min length1
ValueCountFrequency (%) 
Uppercase_Letter 12 42.9%
 
Lowercase_Letter 10 35.7%
 
Decimal_Number 4 14.3%
 
Dash_Punctuation 1 3.6%
 
Space_Separator 1 3.6%
 
ValueCountFrequency (%) 
Latin 22 78.6%
 
Common 6 21.4%
 
ValueCountFrequency (%) 
ASCII 28 100.0%
 

budget
Real number (ℝ≥0)

MISSING
SKEWED
Distinct count439
Unique (%)9.6%
Missing492
Missing (%)9.8%
Infinite0
Infinite (%)0.0%
Mean39752620.436387606
Minimum218.0
Maximum12215500000.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum218
5-th percentile500000
Q16000000
median20000000
Q345000000
95-th percentile130000000
Maximum1.22155e+10
Range1.221549978e+10
Interquartile range (IQR)39000000

Descriptive statistics

Standard deviation206114898.4
Coefficient of variation (CV)5.184938658
Kurtosis2724.257433
Mean39752620.44
Median Absolute Deviation (MAD)37695559.05
Skewness48.15743539
Sum1.809141756e+11
Variance4.248335136e+16
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20000000 174 3.5%
 
15000000 143 2.8%
 
25000000 142 2.8%
 
30000000 141 2.8%
 
10000000 135 2.7%
 
40000000 131 2.6%
 
35000000 120 2.4%
 
5000000 111 2.2%
 
50000000 101 2.0%
 
60000000 92 1.8%
 
Other values (429) 3261 64.7%
 
(Missing) 492 9.8%
 
ValueCountFrequency (%) 
218 1 < 0.1%
 
1100 1 < 0.1%
 
1400 1 < 0.1%
 
3250 1 < 0.1%
 
4500 1 < 0.1%
 
ValueCountFrequency (%) 
1.22155e+10 1 < 0.1%
 
4200000000 1 < 0.1%
 
2500000000 1 < 0.1%
 
2400000000 1 < 0.1%
 
2127519898 1 < 0.1%
 

title_year
Real number (ℝ≥0)

MISSING
Distinct count91
Unique (%)1.8%
Missing108
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean2002.4705167173252
Minimum1916.0
Maximum2016.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum1916
5-th percentile1979
Q11999
median2005
Q32011
95-th percentile2015
Maximum2016
Range100
Interquartile range (IQR)12

Descriptive statistics

Standard deviation12.47459892
Coefficient of variation (CV)0.006229604289
Kurtosis7.439212616
Mean2002.470517
Median Absolute Deviation (MAD)8.554733481
Skewness-2.29227335
Sum9882192
Variance155.6156182
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2009 260 5.2%
 
2014 252 5.0%
 
2006 239 4.7%
 
2013 237 4.7%
 
2010 230 4.6%
 
2015 226 4.5%
 
2008 225 4.5%
 
2011 225 4.5%
 
2005 221 4.4%
 
2012 221 4.4%
 
Other values (81) 2599 51.5%
 
ValueCountFrequency (%) 
1916 1 < 0.1%
 
1920 1 < 0.1%
 
1925 1 < 0.1%
 
1927 1 < 0.1%
 
1929 2 < 0.1%
 
ValueCountFrequency (%) 
2016 106 2.1%
 
2015 226 4.5%
 
2014 252 5.0%
 
2013 237 4.7%
 
2012 221 4.4%
 

actor_2_facebook_likes
Real number (ℝ≥0)

ZEROS
Distinct count917
Unique (%)18.2%
Missing13
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean1651.7544731610337
Minimum0.0
Maximum137000.0
Zeros55
Zeros (%)1.1%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile26
Q1281
median595
Q3918
95-th percentile11000
Maximum137000
Range137000
Interquartile range (IQR)637

Descriptive statistics

Standard deviation4042.438863
Coefficient of variation (CV)2.447360627
Kurtosis256.7951889
Mean1651.754473
Median Absolute Deviation (MAD)1979.395883
Skewness9.884733179
Sum8308325
Variance16341311.96
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000 309 6.1%
 
11000 111 2.2%
 
2000 100 2.0%
 
3000 76 1.5%
 
0 55 1.1%
 
10000 47 0.9%
 
14000 41 0.8%
 
13000 40 0.8%
 
826 37 0.7%
 
4000 34 0.7%
 
Other values (907) 4180 82.9%
 
ValueCountFrequency (%) 
0 55 1.1%
 
2 14 0.3%
 
3 14 0.3%
 
4 12 0.2%
 
5 10 0.2%
 
ValueCountFrequency (%) 
137000 1 < 0.1%
 
29000 1 < 0.1%
 
27000 2 < 0.1%
 
25000 3 0.1%
 
23000 6 0.1%
 

imdb_score
Real number (ℝ≥0)

Distinct count78
Unique (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.442137616498116
Minimum1.6
Maximum9.5
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum1.6
5-th percentile4.4
Q15.8
median6.6
Q37.2
95-th percentile8.09
Maximum9.5
Range7.9
Interquartile range (IQR)1.4

Descriptive statistics

Standard deviation1.125115866
Coefficient of variation (CV)0.1746494615
Kurtosis0.9356915064
Mean6.442137616
Median Absolute Deviation (MAD)0.8730186468
Skewness-0.7414713363
Sum32487.7
Variance1.265885711
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.6 2.65 3.25 4.05 4.75 ... 7.85 8.15 8.55 8.85 9.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6.7 223 4.4%
 
6.6 201 4.0%
 
7.2 195 3.9%
 
6.5 186 3.7%
 
6.4 185 3.7%
 
7.3 184 3.6%
 
7 184 3.6%
 
7.1 181 3.6%
 
6.8 181 3.6%
 
6.1 179 3.5%
 
Other values (68) 3144 62.3%
 
ValueCountFrequency (%) 
1.6 1 < 0.1%
 
1.7 1 < 0.1%
 
1.9 3 0.1%
 
2 2 < 0.1%
 
2.1 3 0.1%
 
ValueCountFrequency (%) 
9.5 1 < 0.1%
 
9.3 1 < 0.1%
 
9.2 1 < 0.1%
 
9.1 3 0.1%
 
9 3 0.1%
 

aspect_ratio
Real number (ℝ≥0)

MISSING
Distinct count22
Unique (%)0.5%
Missing329
Missing (%)6.5%
Infinite0
Infinite (%)0.0%
Mean2.22040305473059
Minimum1.18
Maximum16.0
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB

Quantile statistics

Minimum1.18
5-th percentile1.66
Q11.85
median2.35
Q32.35
95-th percentile2.35
Maximum16
Range14.82
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation1.385112535
Coefficient of variation (CV)0.6238113087
Kurtosis90.65322055
Mean2.220403055
Median Absolute Deviation (MAD)0.4004107589
Skewness9.390056312
Sum10466.98
Variance1.918536735
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2.35 2360 46.8%
 
1.85 1906 37.8%
 
1.78 110 2.2%
 
1.37 100 2.0%
 
1.33 68 1.3%
 
1.66 64 1.3%
 
16 45 0.9%
 
2.2 15 0.3%
 
2.39 15 0.3%
 
4 7 0.1%
 
Other values (12) 24 0.5%
 
(Missing) 329 6.5%
 
ValueCountFrequency (%) 
1.18 1 < 0.1%
 
1.2 1 < 0.1%
 
1.33 68 1.3%
 
1.37 100 2.0%
 
1.44 1 < 0.1%
 
ValueCountFrequency (%) 
16 45 0.9%
 
4 7 0.1%
 
2.76 3 0.1%
 
2.55 2 < 0.1%
 
2.4 3 0.1%
 

movie_facebook_likes
Real number (ℝ≥0)

ZEROS
Distinct count876
Unique (%)17.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7525.9645052548085
Minimum0
Maximum349000
Zeros2181
Zeros (%)43.2%
Memory size39.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median166
Q33000
95-th percentile40000
Maximum349000
Range349000
Interquartile range (IQR)3000

Descriptive statistics

Standard deviation19320.44511
Coefficient of variation (CV)2.567171968
Kurtosis41.33443692
Mean7525.964505
Median Absolute Deviation (MAD)11022.02801
Skewness5.05892689
Sum37953439
Variance373279599.2
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 1.000e+00 9.250e+01 4.915e+02 9.995e+02 ... 6.550e+04 8.400e+04 1.235e+05 1.980e+05 3.490e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2181 43.2%
 
1000 109 2.2%
 
11000 83 1.6%
 
10000 81 1.6%
 
12000 62 1.2%
 
13000 58 1.2%
 
2000 56 1.1%
 
15000 53 1.1%
 
14000 50 1.0%
 
16000 47 0.9%
 
Other values (866) 2263 44.9%
 
ValueCountFrequency (%) 
0 2181 43.2%
 
2 2 < 0.1%
 
3 1 < 0.1%
 
4 5 0.1%
 
5 2 < 0.1%
 
ValueCountFrequency (%) 
349000 1 < 0.1%
 
199000 1 < 0.1%
 
197000 1 < 0.1%
 
191000 1 < 0.1%
 
190000 1 < 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

colordirector_namenum_critic_for_reviewsdurationdirector_facebook_likesactor_3_facebook_likesactor_2_nameactor_1_facebook_likesgrossgenresactor_1_namemovie_titlenum_voted_userscast_total_facebook_likesactor_3_namefacenumber_in_posterplot_keywordsmovie_imdb_linknum_user_for_reviewslanguagecountrycontent_ratingbudgettitle_yearactor_2_facebook_likesimdb_scoreaspect_ratiomovie_facebook_likes
0ColorJames Cameron723.0178.00.0855.0Joel David Moore1000.0760505847.0Action|Adventure|Fantasy|Sci-FiCCH PounderAvatar8862044834Wes Studi0.0avatar|future|marine|native|paraplegichttp://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_13054.0EnglishUSAPG-13237000000.02009.0936.07.91.7833000
1ColorGore Verbinski302.0169.0563.01000.0Orlando Bloom40000.0309404152.0Action|Adventure|FantasyJohnny DeppPirates of the Caribbean: At World's End47122048350Jack Davenport0.0goddess|marriage ceremony|marriage proposal|pirate|singaporehttp://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_11238.0EnglishUSAPG-13300000000.02007.05000.07.12.350
2ColorSam Mendes602.0148.00.0161.0Rory Kinnear11000.0200074175.0Action|Adventure|ThrillerChristoph WaltzSpectre27586811700Stephanie Sigman1.0bomb|espionage|sequel|spy|terroristhttp://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1994.0EnglishUKPG-13245000000.02015.0393.06.82.3585000
3ColorChristopher Nolan813.0164.022000.023000.0Christian Bale27000.0448130642.0Action|ThrillerTom HardyThe Dark Knight Rises1144337106759Joseph Gordon-Levitt0.0deception|imprisonment|lawlessness|police officer|terrorist plothttp://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_12701.0EnglishUSAPG-13250000000.02012.023000.08.52.35164000
4NaNDoug WalkerNaNNaN131.0NaNRob Walker131.0NaNDocumentaryDoug WalkerStar Wars: Episode VII - The Force Awakens8143NaN0.0NaNhttp://www.imdb.com/title/tt5289954/?ref_=fn_tt_tt_1NaNNaNNaNNaNNaNNaN12.07.1NaN0
5ColorAndrew Stanton462.0132.0475.0530.0Samantha Morton640.073058679.0Action|Adventure|Sci-FiDaryl SabaraJohn Carter2122041873Polly Walker1.0alien|american civil war|male nipple|mars|princesshttp://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1738.0EnglishUSAPG-13263700000.02012.0632.06.62.3524000
6ColorSam Raimi392.0156.00.04000.0James Franco24000.0336530303.0Action|Adventure|RomanceJ.K. SimmonsSpider-Man 338305646055Kirsten Dunst0.0sandman|spider man|symbiote|venom|villainhttp://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_11902.0EnglishUSAPG-13258000000.02007.011000.06.22.350
7ColorNathan Greno324.0100.015.0284.0Donna Murphy799.0200807262.0Adventure|Animation|Comedy|Family|Fantasy|Musical|RomanceBrad GarrettTangled2948102036M.C. Gainey1.017th century|based on fairy tale|disney|flower|towerhttp://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1387.0EnglishUSAPG260000000.02010.0553.07.81.8529000
8ColorJoss Whedon635.0141.00.019000.0Robert Downey Jr.26000.0458991599.0Action|Adventure|Sci-FiChris HemsworthAvengers: Age of Ultron46266992000Scarlett Johansson4.0artificial intelligence|based on comic book|captain america|marvel cinematic universe|superherohttp://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_11117.0EnglishUSAPG-13250000000.02015.021000.07.52.35118000
9ColorDavid Yates375.0153.0282.010000.0Daniel Radcliffe25000.0301956980.0Adventure|Family|Fantasy|MysteryAlan RickmanHarry Potter and the Half-Blood Prince32179558753Rupert Grint3.0blood|book|love|potion|professorhttp://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1973.0EnglishUKPG250000000.02009.011000.07.52.3510000

Last rows

colordirector_namenum_critic_for_reviewsdurationdirector_facebook_likesactor_3_facebook_likesactor_2_nameactor_1_facebook_likesgrossgenresactor_1_namemovie_titlenum_voted_userscast_total_facebook_likesactor_3_namefacenumber_in_posterplot_keywordsmovie_imdb_linknum_user_for_reviewslanguagecountrycontent_ratingbudgettitle_yearactor_2_facebook_likesimdb_scoreaspect_ratiomovie_facebook_likes
5033ColorShane Carruth143.077.0291.08.0David Sullivan291.0424760.0Drama|Sci-Fi|ThrillerShane CarruthPrimer72639368Casey Gooden0.0changing the future|independent film|invention|nonlinear timeline|time travelhttp://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1371.0EnglishUSAPG-137000.02004.045.07.01.8519000
5034ColorNeill Dela Llana35.080.00.00.0Edgar Tancangco0.070071.0ThrillerIan GamazonCavite5890Quynn Ton0.0jihad|mindanao|philippines|security guard|squatterhttp://www.imdb.com/title/tt0428303/?ref_=fn_tt_tt_135.0EnglishPhilippinesNot Rated7000.02005.00.06.3NaN74
5035ColorRobert Rodriguez56.081.00.06.0Peter Marquardt121.02040920.0Action|Crime|Drama|Romance|ThrillerCarlos GallardoEl Mariachi52055147Consuelo Gómez0.0assassin|death|guitar|gun|mariachihttp://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1130.0SpanishUSAR7000.01992.020.06.91.370
5036ColorAnthony ValloneNaN84.02.02.0John Considine45.0NaNCrime|DramaRichard JewellThe Mongol King3693Sara Stepnicka0.0jewell|mongol|nostradamus|stepnicka|vallonehttp://www.imdb.com/title/tt0430371/?ref_=fn_tt_tt_11.0EnglishUSAPG-133250.02005.044.07.8NaN4
5037ColorEdward Burns14.095.00.0133.0Caitlin FitzGerald296.04584.0Comedy|DramaKerry BishéNewlyweds1338690Daniella Pineda1.0written and directed by cast memberhttp://www.imdb.com/title/tt1880418/?ref_=fn_tt_tt_114.0EnglishUSANot Rated9000.02011.0205.06.4NaN413
5038ColorScott Smith1.087.02.0318.0Daphne Zuniga637.0NaNComedy|DramaEric MabiusSigned Sealed Delivered6292283Crystal Lowe2.0fraud|postal worker|prison|theft|trialhttp://www.imdb.com/title/tt3000844/?ref_=fn_tt_tt_16.0EnglishCanadaNaNNaN2013.0470.07.7NaN84
5039ColorNaN43.043.0NaN319.0Valorie Curry841.0NaNCrime|Drama|Mystery|ThrillerNatalie ZeaThe Following738391753Sam Underwood1.0cult|fbi|hideout|prison escape|serial killerhttp://www.imdb.com/title/tt2071645/?ref_=fn_tt_tt_1359.0EnglishUSATV-14NaNNaN593.07.516.0032000
5040ColorBenjamin Roberds13.076.00.00.0Maxwell Moody0.0NaNDrama|Horror|ThrillerEva BoehnkeA Plague So Pleasant380David Chandler0.0NaNhttp://www.imdb.com/title/tt2107644/?ref_=fn_tt_tt_13.0EnglishUSANaN1400.02013.00.06.3NaN16
5041ColorDaniel Hsia14.0100.00.0489.0Daniel Henney946.010443.0Comedy|Drama|RomanceAlan RuckShanghai Calling12552386Eliza Coupe5.0NaNhttp://www.imdb.com/title/tt2070597/?ref_=fn_tt_tt_19.0EnglishUSAPG-13NaN2012.0719.06.32.35660
5042ColorJon Gunn43.090.016.016.0Brian Herzlinger86.085222.0DocumentaryJohn AugustMy Date with Drew4285163Jon Gunn0.0actress name in title|crush|date|four word title|video camerahttp://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_184.0EnglishUSAPG1100.02004.023.06.61.85456